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Should a test taker change his initial 'response to a multiple choice 
item? .^e changed responses more likely to be correct than are initial 
responses? According to conventional wisdom, the first response is the best 
response. For .,exan?)le. Jacobs (1972) asked students to suinmarize their " ' 
answer-changing experience. A plurality felt they had lost points, with 
about as many students having nq opinion as estimating a gain. AlTo, a - 
Class of graduate students taught by' one. of the authors strongly advised 
against answer changing. Lynch and Smith (1975) quoted several published ' - 
sources who advise against changing answers, and also surveyed students with •- 
similar^results: over'^5% of the students estima ted t hat chan ging answ.r. 
-woul^ tend to "lower their scores. * 



In contrast, empirical evidence has consistently shown increments due 
to changing initial responses (e.g.. Lynch § Smith. 1975; Matthews. 1929; 
Mehrens § Lehman. 1973. p. 317; Reiling § Taylor. 1972; Smith § Moore. 1976). 
Further, several researchers compared answer-changing behavior Lnong, sub- 
groups of their total samples. For exa^le. subgroups were defined according 
. tp total test- score, course grades. GPA. or a ^Standardized cognitive test. 
Generally th^ better^students tended to gain more* than did the poorer 
^ students (Archer iVippert. 1962; Bath. 196^; Copeland. 1972; Mueller § 

!>■ : ' : 

A paper presented at the convention of the National Council on Measurement 
m Education, San Francisco. April 1976. Measurement 
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Shwedel, 1975; Reile § BriggS, 1952), although no differenqes were found by ^ 
-Jacobs (1972) and Pascale (19"74) . Several o^ these researchers also invftsti- 
(gated sex differences with very, mixed results. Yet even though some groups 
vgained more than others, the Average for -each of these subgroups . was an 
' increment. * 

Based on empirical evidence, then", the test, taker should expect a gain 
in score if he reconsiders an item, concludes that his response should be * 
changed, and then makes the change. Such }l discrepancy in advice vs. / 
evidence suggests a need, for further investigation using a different approach. 
Perhaps if subgroups are defined quite differently, answer-changing behavior 
/Would differ. "It is possible, for instance, that certain personality > 
variables such as impulsivity or anxiety ... may correlate with the inci- 
dence and ^fectiveness of answer-changing behavior" (Muel^Lejr 5 Shwedel, 1975). 
Th e following ques tions gui ded thi s i nvestigation: 1) To what extent 



do t3=st takers change answers? 2) ' Does* changing answers tend to result in 
higher scores? Defiping answer- changing behavi^or as number of changes and 
increment .diie to changing, 3) Does answer-changing behavior relate to cogni- 
tive styles and to attitudes? 4) answer- changing behavior related to 
academic performance? 

, Sainple 

Students from three universities enrolled in an educational measurement 
and evaluation. course constitute the sample. The four classes numbered 43'; 
24, 17,. and 50 students. ' Two of the classes were taught by one professor, 
two by another professor. The majority cf the students i^ere inservice or 
preservice teachers. Prior to all data being collected, the professors did 
not mention the appropriateness of changing answers, nor the purpose of the 
^ study. 



EKLC 



Item Response Chang^es 
3 

instrumentation 

. - The dependent variables are the number of changed item responses and 
the increment in score ffom.changing responses on classroom tests in the 
educational measurement and evaluation course. F6r two ^e classes the 
quizzes totalled 54 and .75 multiple choice it^is, feost of which had' four 
alternatives. For the other two classes the quizzes totalled 132 and 135 , 
^/ items, most of whi^h, were multiple choice^ with a small p^rcentagel^true- 
. false items. The data were collected unobtrusively, wiih no special 

directions given to the respondents. TVo judges examined each, answer ' - . 
, sheet and agreed on 'the changes made by the respondent. When an item 
response was changed, the changes were recorded in one of the following 
categories: . 1) wrong response changed to right response, 2) wrong response ' 
changed to another wroi^ response, and 3) right response xli an ged to wrong 
response. ^ • 



The independent variables include measures of four cognitive styles, 
plus self-report grades, attitudes towards the course and towards tests, 
course performance^ and sex. The cognitive styles and the major measure 
used for each are: , ? ' 

impulsivity-reflectivity, measured by Barran's Personal Evaluation ■ 

^ ft- 

' (Barratt); 

anxiety, measured by Spielberger, Gorsuch, and Lusheme's Self- ^ 
^ Evaluation Questionnaire (1968); \ 

. preference for complexity, measured by Barron's (1963) Complexity 
Scal6; and * D . \ 

„ - field .independence, measured by t^e Hidden Figures Test^ (Educational 
Testing Service, 1962) . " , 

Several variables were assessed using a three-page questionnaire which 
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contained a -,rUty of item ty^%. . Brief questions sollcited'both under! ' 
graduate a, ,duate GPAs and a report; of previous course work in Measure. - 
^ment. Ukert .te»s allowed specification of attitudes .toward the. course ' 
»d toward the quizzes. A se«u,tic differential item specified attitudes 
toward testing using both the evaluation and potency dimensions. Several 
. \__^'"in8 sca^s were constructed to measure cognitive styles. 

Cfifee performance was assessed both as tota} score on the quizzes and 
^ letter grade for the course. Because of the different numbers of items for . 
each Class, quiz score, nmnber of changes, and. increment from ch'anging we're ^ " 
converted to standard scores within class "for co.^t ing total gro,^^orreliS?,s 
^ " ' Results 

^ All students changed at;^ least one answer. With th, mean number of, changes 
.ranging fro. 5.0 to^8.6 depending on the class (see^able J^X Although one 

^ point. Most people gained from chan^^ing. the percentage of students gaining 
. - ranged from 71% to 91% for each of the four classes, and the mean ana median 

increment tended to be approximately three-points for each class. ' 

Per^formance on the cognitive style variables is described in Table 2. 
Inspection of this table su^ests that Group 3 -s perfprmande 'ofteii d x>fer.d 
from the others; however.^ it was the smallest sample. ' 

The best predictor 'of gain from changing; answers was the number of items 
, 6n Which answers were changed. As shown in Table 3, the correlati;. coeffic 
ients ranged from .76 to .80. (Specifically, the coefficient for Group 1 was 
.80. tor Group 2 it was .78. Group 3 and Group 4 were ea'ch . 7^.^ and the total 
group was .78.) * • . 

Tke relationships of cognitive style measures with answer changing behav- ; 
iors were less consistent than was the relationship betweert. the two ways of " 

> . ^" , \ ^ ,^ . . . ^ . • 
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describing answer changing behaviors. For the first group those .who changed 
and profitM fro. changing te»ied to be low anxious,, for Group 2 i^ulsive ' 
for Ctoup 3 anxious, impulsive, and orderly,, for Group 4 non-co^ulsive; and 
for th^ total group i.,ulsive. (While the coefficients used in support of ' 
the abo-ve statement tend to be Just above or Just below the value needed ... ' 
significance at the .05 level, -the extent of inconsistency within cells of 
the first two rows of Table 3, plus the number of near zero correlations 
suggests caution in inferring relationships between answer chiTging behavior 
and cognitive style constructs.) 

■ Several cognitive style vkriables showed greater consistency with each 
.other, however. Por example, anxiety measured using the Spielberger instrument 
.correlated appropriately with. Questionnaire items, e.g., positively with anxiety 
and negatively with happiness and. extroversion.. THe field independents tended ' 

■ to consider themselves low anxious and happy ■ ihot. hi.i, 

oiiu iiiippy. inose high m impulsivity tended 

„to„be_seltrated,as-compul-slv.-,-extroverted, and cisu.1. Frequently for 
these relationships wittin the cognitive style.variables, as well as for 
other re'lationshipsvthe inconsistent group-„as the' third (and smallest), one. ' 
• Attitudes toward tests generally, -towart the course, and toward the 
quizzes, in. the course tended to be minimall^ related ^o answer changing 

■behavior. .Semantic differential ratings.of tests grouped »der evaluation 

and potency dimensions\orrelated between ..16.a«d -.18 with answer changing 

behavior for the total group. 

While females in the fourth group changed more answers and, profited . ^ 
mor^ fr.m thanges (rs = .24 and .31), the correlations for the total group • 
were only .06 Wd .14. ' 

^^demio performance "in general and for th'is course iii particular was ' ' 
also-in^f^ective in estimating ^nswer changing behavior, as may b^ see^-in ^ 
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Table 4. The general academic performance variables are based on self 
report; the extent of bias due to self report is likely trivial (see,, e.g., 
McMorris ^ Aabrosino, 1973). • ' . 

^ Discussion 
me consistency of results among studies of answer changing remains 
unblemished: test takers change answers, and when they do, the tendency 
is to gain rather than either to lose or to remain at the same score. 

The search for cognitive style, attitude, and academic performance 
variables to correlate with answer-changing behavior was attempted. Most 
relationships were quite low, especially when all four classes w"^ combined. 
- Judging from these data, impulsivity might be the ..03t likely of ^e present 
cognitive style and" attitude variables to show relationships , with number of . 
, changes and increment from changing in a replication. As noted earlier, 
several researchers have found that academic performance is positively 
related to answer-changing behavior, although our no difference findings 
"^are not 'unique. ^ « » 

One reason for low relationships with answer. changing is, the limited 
reliability of the answer changing measures. For the fourth '(and largest)- 
cAass we iiitercorrelated the number of changes between pairs of the three' 
quizzes, averaged usi^g the z transformation, and corr^ted the reconverted 
average using the Spearman- Brown formula; We repeated the process, for the 
increment. The reliability estimates w^re .76 and .21, respectively. . ^"^ 

Why. do test givers caution test takers not to change answers? One 
reason may be that changes from right to wrong are better -rememfeered thin 
are changes from wrong to right. Students going over' a corrected test are ' 
more likely to be wondering why they got certain iteiis wrong and not why . 
they got other items right. ' They are thus more likely to. notice right to 
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wrong changes and* be convinced that changing responses tends to hurt 'their 
test scores (Lynch § Smith, 197S, p. 223). Perhaps we humans expect a gain 
and can't* tolerate a loss,. and recall the losses more vividly. 

Another reason for the caution may have, to do with how test givers 
conceptualize memory. For example, memory may consist of ideas or concepts 
that represent copies of sensory experiences. Remembering or recall may 
require' the arousal. of a» sequence of neural connections that have stored 
these copies of "reality." Utilizing this theoretical perspective, an 
associative network is presumed to be responsible for relating stimuli and 
responses. Xn'test 'taking situatipns, the tester assumes that the first 
presentation of the stimulus (the multiple choice question) will most likely 
evoke the response (correct answer option); later evaluation and reconsider- 
ation of the question might disrupt the original memory trace. Thus, the ^ 
test taker is discouraged from altering his initial response. 

A review. of pertinent researph findings indicates that changing 
answers generally augments the overall performance of the test taker. An 
explanation for this consistent empirical finding might be to consider 
memory as an active .process that synthesizes* stored informatiqn to create 
a new construction. Jenkins (1974) suggests that this theoretical position, 
^^ch he refers- to as contextualism, is more useful in conceptualizing human 
memory processes than is the traditional associsttionistic model. This . 
theoretical perspective suggests that remembering is an active process of 
elaboration that takes time. The reevaluation of questions posed during an 
examination session could lead^ to an effective reconstruction of the concepts 
required in answering these questions. The additional experience gained' 
through exposure to related topics during the examination session should 
provide information that will augment the development of pertinent concepts 
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• to be used during the review of answer options. .The test taUr will^ use 
this additional information to choose th^ best option av^ilabit -changing 
answers-, based on a constructive memory process, would be expected t^ 
augment over'all performance. < " ' . 

^uch a view of memory is supported hy Lynch and Smith- (197^, p. 223). - 

The-- author^ hypothesize that students taking exams 
have th^r memories jogged by other items or other 
meroori^'s so that reconsideration after having 
answered other items makes students more able to 
, reason the correct answer. Students going over I 
a corrected test are more likely to be T^ondering 
^ .why they got certain items wrong and not why they 
got other items right.- They are thus more likely 
to notice right to wrong changes Md be convinced 
that changing resjgpnses tends to hurt their test 
^ scores. Thus, many students do not change 

responses. . - 

The consistency^of the research findings suggests that countless numbers 
of test takers may be penalized by inappropriate advice. Yet, the performance 

• of test takers suggests 'thafthis advice is either' ignored or conditionally 
interpreted, i.e., individuals do change answers on multiple choice tests. 

But should bur advice be to change answers? (Advice to change is typically - 

• given i,^ the discussion or -conclusion section of studies' on answer-changing 
behavior. X. Why should we question t^e application of such a consistent ' 
research finding? TVo concerns of ours are specified below. ^Tlie unobtrusive 
nature of the data collection procedure used here provides a naturalistic 

^approach but does not permit unambiguous interpretation of a student's rationale 
- for changing responses. Using a^contrived experimental procedure," Jacobs (1972) 
has demonstrated results similar' to ours, bi^ even with his additional controls 
the reasons for the changes Ve unclear. For example, what percentage of . 
changes are clerical corrections? What percentage are due' to clues from other 
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itei^? What percentage are due tg additional thinking. about the itemf 

What are the effects of attempting to modify answer changing strategies? 
instructions cc,uld De varied alo^ change-do-not-change continuum. TT.is 
^ approach could be use^l in evaluating the effects of a^ „K>dified strategy in 
terms of both number of changes and increment in test perfort«nce. ' If. test ^ 
takers are ' encouraged to change a, higher- percentage of responses, would the , 
increment from answer changing documented " in the research literature remain ^ 
an increment? " 
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Table 1 



Nujd)er of Answer Changes and Increment from Changing 

for Each of Four Groups / 



> Number of Answer Changes Increment from' Changin g 

.1-2 3 4 - 1 ■ 2 ' 3 4 



Statistic ^ 





- Mean..,' 
Median 
ModeCs> 
Standard Dey^ 
Low-Hi^ 



5.0 8.6 5.6 6.2 

5' V 6/5 5 . 5 

'1,6 X - 3 

2.8 7.1' f 3.,2- 5.1' 



3.0- 2.8 



1.9 3.4 



3 
3 

2.1 



3 
3 
2.6 



2 
1.9 



3 
5 

2;6 



1-12 1-35/- 1-11 1-27 (-l)-8. (-l)-9 (-l)-5 (-3^-10 



R'iercent of n Gaining " 
/Perteivt) of n Losing 

Number of Items , ^ 74 135 54 132 



Number of Testees 



43 24\, 17 



50 



91 
2 



79 
12 



71 
12 



90 
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Table 2 

Means and Standard Deviations for 
^Cognitive Style Variables 
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Means v 



Group 



Standard Deviat^ions 
Group 



variaDie *^ 


.1. 


2 


3 


■ . 4 


Total 




1 


2 ■ 


3 


4 


Total 


Anxiety: 






















r 


State 


37.7 


•■34-C'6 


36.6 


36.1 


36.4 


9 


.5 


8.2 


6.f 


/II. 4 


■ 9.7 


Trait ' 


38.8 


36.9 


42.8 


36.7 


38.0 


g 






7 

• Q C 


y.o 


9.1 


Total ' . ' 


76.4 


71.5 


79.3 


72.8 


74.4 


15 


.2 


15.1 


12.4 


19.6 


16.8 


mxious-Calm 


^ 3.4 


3.7 


4.1 


3.4 


3.5 


1 


.6 


i.6 


1.7 


1.8 


1.7 


/ 

Happy-Unhappy 




5.7 


5.0 


5.3 


5.4 


1, 


.1 


1.0 


.9 


1.5 


1.2 


Fxold IndeDendencp 


mJ • O 


ft 7 




10. 1 


9.3 


4, 


■8, 


5.0 


3.9 


6.1 


5.4 


Prctfprence for 
























Complexity - 


23.4 


23.2 


21.1 


23.3 


23.1 


5. 


9 


5.2 


4.6 


6.2 


/ 
5.7 


Inpulsivity 


106.4 


109.3 


104.5 


107.1 


107.1 


12. 


4 


16.2 


11.8 . 


12.4 


13.1- 


Compulsive- 






















Nonco]q)ulsive 


3.9 


4.Q 


4.7 


4.0 ■ 


.. 4.0 


1. 


6 


1.8' 


1.3 


1.6 


i.6 


Extrovert- 
























Introvert 


4.5 


4.7 


,4.1 


4.6 


4.5. 




5 


1.7 


1.4 


1.5- 


1.5 


Orderly-Casual 


4.2 


3.9 


5.0. 


, 4.5 


4.3 


1. 


7 


1.5 


1.7 


1.6 


1.6 


\ 

n 


43 


24 


17 


50 


134 















The bipolar adjectives were incitided on the questionnaire. Maximum agreement 
' „with the word on the left would yield a score of seven 
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• Table 4 

Correlations of Answer Cniangijig Behaviors 

with Academic PerfoMiance 
Four Groups and Total (Decimals Omitted) 



Academic Performance 



Answer Changing 



Number of Changes' 



In General : ^ 



Increment 



Undergraduate GPA 

Previous Testing Course (yes-no) 

Nunber of Graduate Courses^b 

Gradxiate GPA 
This Course: 



-06 -09 
-36 . -01 
-07 

-20 -08 
44 09 

. 02 



27 
30 



-12 
00 



05 



13 02 
05 20 
' 11 



-01 -03 

-38 07 
-01 

-12 -05 

35 \ -09 

43 -07 

30 15 



13 13 
-10 26 
14 



Total Quiz Score 



Letter Grade 



21 
-09 



-07 
-08 



-02 



11 -27 
■21 -08 
-11 



16 03 

05 " 08 
08 

05 -12 
-26 02 I 

-03 / 



Note, Coefficients for the four classes and the total group are given in 
order; nS- = 43. 24, 17, 50 and 134, respectively ^ 

"In General" variables were included on the questionnaire. ' 
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